CDS

Accession Number TCMCG024C48335
gbkey CDS
Protein Id XP_022029349.1
Location complement(join(161788981..161789003,161789303..161789415,161789533..161789604,161790199..161790249,161790347..161790434,161790534..161790654,161790746..161790818,161790911..161790974,161792299..161792492,161792611..161792660,161792753..161792816,161792887..161793005))
Gene LOC110930359
GeneID 110930359
Organism Helianthus annuus

Protein

Length 343aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA396063
db_source XM_022173657.2
Definition hydroxyproline O-galactosyltransferase HPGT1 [Helianthus annuus]

EGGNOG-MAPPER Annotation

COG_category G
Description Belongs to the glycosyltransferase 31 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01003        [VIEW IN KEGG]
KEGG_ko ko:K20854        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0005768        [VIEW IN EMBL-EBI]
GO:0005794        [VIEW IN EMBL-EBI]
GO:0005802        [VIEW IN EMBL-EBI]
GO:0006029        [VIEW IN EMBL-EBI]
GO:0006464        [VIEW IN EMBL-EBI]
GO:0006486        [VIEW IN EMBL-EBI]
GO:0006493        [VIEW IN EMBL-EBI]
GO:0006807        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0008378        [VIEW IN EMBL-EBI]
GO:0009058        [VIEW IN EMBL-EBI]
GO:0009059        [VIEW IN EMBL-EBI]
GO:0009100        [VIEW IN EMBL-EBI]
GO:0009101        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0010384        [VIEW IN EMBL-EBI]
GO:0010404        [VIEW IN EMBL-EBI]
GO:0010405        [VIEW IN EMBL-EBI]
GO:0012505        [VIEW IN EMBL-EBI]
GO:0016740        [VIEW IN EMBL-EBI]
GO:0016757        [VIEW IN EMBL-EBI]
GO:0016758        [VIEW IN EMBL-EBI]
GO:0018193        [VIEW IN EMBL-EBI]
GO:0018208        [VIEW IN EMBL-EBI]
GO:0018258        [VIEW IN EMBL-EBI]
GO:0019538        [VIEW IN EMBL-EBI]
GO:0031410        [VIEW IN EMBL-EBI]
GO:0031982        [VIEW IN EMBL-EBI]
GO:0031984        [VIEW IN EMBL-EBI]
GO:0034645        [VIEW IN EMBL-EBI]
GO:0036211        [VIEW IN EMBL-EBI]
GO:0043170        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0043412        [VIEW IN EMBL-EBI]
GO:0043413        [VIEW IN EMBL-EBI]
GO:0044036        [VIEW IN EMBL-EBI]
GO:0044237        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0044249        [VIEW IN EMBL-EBI]
GO:0044260        [VIEW IN EMBL-EBI]
GO:0044267        [VIEW IN EMBL-EBI]
GO:0044422        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044431        [VIEW IN EMBL-EBI]
GO:0044444        [VIEW IN EMBL-EBI]
GO:0044446        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0070085        [VIEW IN EMBL-EBI]
GO:0071554        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:0097708        [VIEW IN EMBL-EBI]
GO:0098791        [VIEW IN EMBL-EBI]
GO:1901135        [VIEW IN EMBL-EBI]
GO:1901137        [VIEW IN EMBL-EBI]
GO:1901564        [VIEW IN EMBL-EBI]
GO:1901566        [VIEW IN EMBL-EBI]
GO:1901576        [VIEW IN EMBL-EBI]
GO:1990714        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGCGTAGCCGGGGATCCAACAACCGGCTATCCTCTTCTCGCTCAGCCTTTCAATGGCGAATCTCATCTCTCATTCTCTCCATGTTCGCCACCATGGCCGCCTTCTTCGTTGCCTCTCGATTGTGGCAGGAGGCCGAATCTAGGGTTTATTTAGTTAAAGAACTCGATAGAAGAACCGGTCAGGGGGAATCTGCTATATCTGTTGATGACACATTAAAAGTTATAGAATGCAGGGAACAGCGAAAGAGGATGGATATGCTTCAGAAGGAGCTGGATGAAGCTAAGAAGGAAGGGTTTGTTCCGAAGAATCGGTTGGAAAGTAGAGGGGATGGTGAAAAGAAGAAGCTTCTTGCGGTTGTGGGAATTCTTACGGGATTTGGTCGTAGACATAATAGAGATGCGATCCGTAAGGCATGGATGCCTACTGGGACAACTCTAACAAAGCTAGAAGAAGAAAAAGGCATAATCATACGATTTGTTATAGGCAGAAGCTCGAATCATGGAAATAATTCAGACAGTGACATCATCAACGAGAACGAAAGGACAAATGACTTCCTTATTCTTAATGATCACGTGGAGTCGTTAGAACAGCCAATAAAAACCAAGTCGTTCTTTGTTGATGCCTTACAACACTGGGATGCAGAGTTCTATGTAAAGGTCAATGATGACATTTATCTAAATATTGATGCCCTCGGTGCTATTCTTTCAACCCATGTGAACAAGCCTCGGGCCTATATTGGGTGTATGAAATCTGGTGGTGTTTTCTCCAAACCGAGTGACAGATGGTATGAGCCAGAGTGGTGGAAATTTGGGGATAAAAAATCATATTTTCGACATGCTTCCGGGGAAATATTTGCTGTATCTCAAGCTTTGGCTCAGTTTATCTCAATAAACAAGTCAATACTTCGTGCATATGCTCATGATGATGTGAGCGTTGGATCATGGTTCATTGGTCTTGATGTGAAGCATATTGATGAAGGGAAGTTTTGTTGCTCATCTTGGTCTTCAGGGGCAATATGTGCAGCTTCTTGA
Protein:  
MRSRGSNNRLSSSRSAFQWRISSLILSMFATMAAFFVASRLWQEAESRVYLVKELDRRTGQGESAISVDDTLKVIECREQRKRMDMLQKELDEAKKEGFVPKNRLESRGDGEKKKLLAVVGILTGFGRRHNRDAIRKAWMPTGTTLTKLEEEKGIIIRFVIGRSSNHGNNSDSDIINENERTNDFLILNDHVESLEQPIKTKSFFVDALQHWDAEFYVKVNDDIYLNIDALGAILSTHVNKPRAYIGCMKSGGVFSKPSDRWYEPEWWKFGDKKSYFRHASGEIFAVSQALAQFISINKSILRAYAHDDVSVGSWFIGLDVKHIDEGKFCCSSWSSGAICAAS